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ABSTRACT 



A system is provided for encoding clips of video data for 
multiplexing into a system level stream with associated 
audio and control data. By deriving a relationship between 
encoder and decoder buffer occupancy levels, and taking 
into account buffer fill rate, the multiplexer targets a starting 
occupancy for the video system layer buffer (the MPEG 
STD) at that for the decoders video buffer (the MPEG VB V). 
With knowledge of fill rate, the decoder buffer need only be 
filled to a predetermined level prior to reading out clips for 
decoding, rather than filling the buffer completely, and 
seamless joining of video clips can then be simply achieved. 
The technique has particular application to interactive mul- 
timedia systems where continuous display is required. 

9 Claimsj, 5 Drawing Sheets 
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ENCODER SYSTEM LEVEL BUFFER 
MANAGEMENT 

CROSS REFERENCE TO RELATED 

APPLlCAnONS 5 

This is a continuation of application Sen No. 08/565,698 
filed Nov. 30, 1995 now abandoned, 

BACKGROUND OF THE INVENTION 

10 

The present invention relates to the encoding of audio and 
video signals and in particular to producing segments of 
audio and video material that can be joined together on the 
fly. 

It is typically the case that, when two video clips arc 15 
required to be played one after the other, the decoder is 
required to reset itself to a starting state before decoding the 
second clip. During this reset, the user will be presented with 
the last frame of the first clip frozen on the screen, together 
with a mute of the accompanying audio. This discontinuity 20 
is inUiisive to the user. 

What is required is a seamless join in which the transition 
between the end of one clip and the start of the next is not 
noticeable to the decoder. This implies that from the user's 
point of view there is no perceptible change in the viewed 
frame rate and the audio continues uninterrupted. AppHca- 
tions for seamless video are numerous. An example from a 
CD-i perspective is the use of photo-real backgrounds for 
computer generated characters; an example use of this 
technique would be an animated character running in front 
of an MPEG coded video sequence. Another is a series of 
character-user interactions such as interactive movies where 
the viewer has the chance to influence development of the 
storyline by selecting from available alternative scenarios. 
Branch points along the path a user chooses to take through 
an interactive movie should appear seamless, otherwise the 
user will lose the suspension of disbelief normally associ- 
ated with watching a movie. 

A method for encoding segments of video such that 
re-initialisation of the decoder is not required is described in 
our UK Patent Application number 9424436,5 (PHB 33950) 
entitled "Video Editing Buffer Management" filed Dec. 2, 
1994. The method uses targeting of decoder buffer levels to 
give consistency of decoder buffer occupancy for the end of 
each video clip and predictability for the start of each 
segment such as to allow successive clips to be joined 
directly without risking overflow or underflow in the 
decoder buffer. 

The above method, whilst suitable for sequences of video 
frames, takes no account of the other information that wiU 
typically accompany it — ^for example an audio soundtrack. 
The video will generally be interleaved with other informa- 
tion into a single stream, referred to as the system stream, 
which constitutes the data that will be presented to a 
decoder/display device (for example the data stream read 
from a compact disc or delivered via a cable network). 

SUMMARY OF THE INVENTION 

It is therefore an object of the present invention to provide go 
for seamless joining of system stream segments whilst 
avoiding overflow or underflow in the decoders buffer. 

In accordance with the present invention there is provided 
a method for encoding of digital video signals, in the form 
of segments each comprising two or more pictures, and in an 65 
encoder apparatus having a coding stage, an encoder buffer, 
and means for interleaving the encoded video signal with at 



2 

least one further signal to form a system stream, the method 
comprising, for each segment, the steps of: successively 
encoding the pictures of the segment according to a prede- 
termined coding scheme; reading the encoded pictures into 
the buffer; reading the encoded segment out of the buffer at 
a substantially constant bit rate; and interleaving the buffer 
output signal into a system stream; characterised in that the 
encoder buffer occupancy is controUed to have a first pre- 
determined level at the moment the last picture of a first 
segment has been read into it, which occupancy level is 
related to a decoder buffer occupancy level, in that the time 
required to fiU a decoder buffer to the said decoder buffer 
occupancy level at the constant bit rate is derived, and in that 
a time stamp is inserted into the system stream of a follow- 
ing segment indicating that the first picture of that segment 
is to be read out of the decoder buffer at that time after 
loading of the said following segment commenced. 

As will be demonstrated hereinafter, the derivation of the 
instant at which a decoder should start to decode a received 
segment enables smooth joining of segments at the system 
stream level. Also, as the decoder buffer target level is a 
calculated level based on the encoder buffer contents, the 
conventional technique of having to fill the decoder buffer as 
full as possible before starting to decode is avoided. 

Preferably, a common decoder buffer level (and hence 
time stamp positioning) is provided. This may be achieved 
by setting a target number of bits used to encode pictures of 
the first segment and controllably varying the bit allocation 
per picture during encoding such as to reach the said encoder 
buffer occupancy level at the end of the segment. 

As wfll be described, in order to define the decoder buffer 
level, a maximum encoder buffer occupancy level is pref- 
erably specified, with the decoder buffer occupancy level 
being defined as the difference between the encoders maxi- 
mum level and the encoder buffer occupancy level one frame 
period after the last picture of the first segment has been read 
into the encoder buffer. 

Also in accordance with the present invention, there is 
provided a digital video signal encoder apparatus, config- 
ured for the encoding of image segments, where each 
segment comprises two or more pictures, the apparatus 
comprising: an encoding stage arranged to receive succes- 
sive pictures of a segment and encode them according to a 
predetermined coding scheme; a buffer coupled to receive 
successive encoded pictures from the encoding stage and 
arranged to output encoded segments at a substantially 
constant bit rate; and a multiplexer coupled to receive the 
buffer output and at least one further signal and arranged to 
interleave them into a system stream; characterised in that 
the encoding stage is operable to encode pictures of a 
segment in a controllably variable number of bits, the 
apparatus further comprising target setting means arranged 
to control the number of bits used by the encoder such as to 
produce a predetermined encoder buffer occupancy at the 
moment the last picture of a first segment is read into the 
buffer, and time stamp means operable to derive a timing 
value based on the said predetermined occupancy and insert 
a time stamp in the system stream of the following segment 
indicating that the first picture of the following segment is to 
be read out of a decoder buffer following a delay given by 
the timing value from the point that loading of said follow- 
ing segment into the decoder buffer commenced. 

The target setting means may suitably be arranged to 
control the number of bits per picture for the last K pictures 
of a segment as described above, and the encoding stage 
may suitably be arranged to provide zero-value bit stufBng 
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where less than the specified number of bits are required to 
code a picture. 

Further in accordance with the present invention there is 
provided a digital video image signal encoded by the method 
described above, together with an optical disc containing 
such a signal in the form of separately addressable segments. 

Further features and advantages of the present itivention 
will become apparent from reading of the following 
description, together with the attached claims, the disclosure 
of which is incorporated herein by reference. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Preferred embodiments will now be described by way of 
example only, and with reference to the accompanying 
drawings in which; 

FIG. 1 represents encoder and decoder video buffer con- 
tents at the conjunction of two video clips; 

FIG. 2 represents the relationship between decoder video 
and system buffer occupancies; 

FIGS. 3 to 5 represent decoder video and system buffer 
contents in a conventional system, and at the start and finish 
of a targeted clip, respectively; 

FIGS. 6 and 7 represent audio buffer contents at the start 
and finish of a targeted audio clip, respectively; and 

FIGS. 8 and 9 schematically represent encoder and 
decoder apparatus, respectively, 

DETAILED DESCRIPTION OF THE 
PREFERRED EMBODIMENT 

The following description relates to encoders and decod- 
ers operating in accordance with MPEG standards and 
conventional nomenclature from those standards will be 
used hereinafter. It will be recognised by the skilled prac- 
titioner however, that use of the present invention is not 
constrained to the MPEG standard. 

Any coding standard must be developed with models of 
how the encoder and decoder interface to one another. As an 
encoder runs it has to model what wiU happen in the decoder 
such that it never sends the decoder into an illegal state. 
Similarly, the decoder must support the same model the 
encoder used so that it remains in a legal state and produces 
the output the encoder intended. MPEG is no exception to 
this rule. 

Initially, it will be useful to consider the production of 
segments of purely video data which can be seamlessly 
joined, A preferred method is described in our above- 
mentioned UK Application number 9424436.5 (PHB 33950) 
and is summarised below with reference to FIG. 1 which 
shows graphs of buffer occupancy B against time t showing 
what happens to the encoder and decoder buffer states as one 
sequence of video frames (pictures) A ends and another B 
starts. LA indicates the last picture of sequence A; FB 
indicates the first picture of sequence B. The change of 
delivery data from sequence A to sequence B is shown by a 
change in thickness of the buffer occupancy line with the 
chain-linked line indicating pictures from sequence A. At 
some time t^ all the data for sequence A has been delivered 
and the VB V (Video Buffering Verifier— the MPEG decoder 
model) has an occupancy of B^ bits. From this time on all the 
data delivered to the buffer is for sequence B. It will be 
rioted, though, that some pictures from the end of sequence 
A are still in the buffer, which pictures are all removed by 
time t^ when the buffer has an occupancy of B, bits. 

The encoder goes through a process known as targeting 
when it tries to achieve a certain occupancy in the VBV 
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buffer. During targeting the encoder assumes that the VBV 
buffer has a certain target occupancy when the first picture 
it has coded is put into the buffer. This places an upper limit 
on the size of the first picture. At the end of a coding run the 
5 encoder targets the VBV occupancy at the point in time just 
before the first picture for the next sequence would be 
removed from the buffer, point B, in FIG. 1. It targets this 
state by changing the size of the last, or last few pictures, as 
it codes. 

10 In the example shown in FIG. 1 the encoder targets the 
state B,. This state represents the VBV buffer occupancy at 
the time jiist before the first picture of the new sequence is 
removed. Since, when an eacodcr runs, it generally has no 
direct contact with the decoder, it manages picture sizes 

15 based on its output buffer rather than transforming to and 
from the VBV buffer state. Accordingly, the following 
passages will refer to B,^ and B,^ (see the encoder occupancy 
values in the upper portion of FIG. 1). 

When targeting a start state the encoder assumes a certain 
occupancy in its buffer at the point when it introduces the 
first picture. This buffer occupancy is B^ bits which repre- 
sents the residual bits from the end of the previous sequence. 
The presence of these bits limits the maximum size of the 
first picture to be B^ bits and continues to have an effect on 
future picture sizes until all the bits have been removed after 
time if^. 

From the encoder's point of view start state targeting is 
very simple since all that is required is for it to set its initial 
occupancy to B^ bits rather than the usual start state of being 
empty. 

When the encoder approaches the end of a sequence it 
tries to target the point B,^ by forcing the size of the last 
picture to be such that when put into the buffer the occu- 

25 pancy will increase to B,^ bits. It could produce a very poor 
quality picture if a large degree of size fixing were applied 
to the last picture only. To overcome this, the encoder 
preferably has a target number of bits for the last GOP 
(Group of Pictures) and a target number of bits for each of 
the K pictures within the GOP, thus allowing the encoder to 
gradually approach the correct state. 

Turning now to the system layer, this is produced when 
multiplexers combine video, audio and other elementary 
streams into one system stream. The system stream contains 

45 interleaved elementary streams and timing information in 
the form of System Clock References (SCR's), Presentation 
Time Stamps (PTS's) and Decoding Time Stamps (DTS's). 

The multiplexer's job is to ensure that data is delivered 
from a system stream into the decoders' buffers fast enough 

50 so that they never completely empty and slow enough so that 
they never overflow. The MPEG standard defines a buffer 
called the System Target Decoder (STD) that is used to 
model the transfer of data from the Digital Storage Medium 
(DSM) into the decoder. The purpose of this buffer is to 

55 mask the fact that, because of data interleaving, dehvery into 
a decoder is in bursts but the decoder performance model 
assumes a constant bit rate. The usual strategy for multi- 
plexing video and audio streams typically starts by filling the 
STD buffer as full as possible. The SCR of the last pack 

60 made is then set so that the first picture is removed from the 
buffer at this time. Next, the SCRs of the previous packs are 
set to the correct values because they arrive ahead of this 
pack in time. Finally, the buffer is maintained as full as 
possible until all the data from the elementary stream has 

65 been consumed. 

The multiplexer operation is somewhat more complex 
than this because the multiplexer has to juggle with multiple 
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Streams. Also a multiplexer has to support any additional 
features that are needed to meet the specS&cation of a system 
stream as it has been defined for the target platform. 

The STD buffer can be thought of as enclosing the VBV 
buffer as shown in FIG. 2. The Figure represents a simplified 
view of the STD and VBV relationship. In FIG. 2 the STD 
buffer occupancy (thick line) is shown enclosing the VBV 
buffer. Apoml to note is that the slope of the STD buffer line 
is higher than that of the VBV Hne. This is because the bit 
rate of the DSM is higher than the coded video bit rate. It can 
also be seen that at times, indicated by reference Q, no data 
is delivered to the STD. This happens because either the 
bandwidth of the DSM is taken up by other elementary 
streams or the multiplexer has decided not to deliver data to 
the STD because of buffer management issues. 

FIG. 3 shows graphs of what happens to the VBV and 
STD occupancy at the start of a decoding a normal, i.e not 
targeted, stream. In the graph, for ease of understanding, the 
times (in terms of system ticks) at which pictures are 
removed from the two buffers are shown as in synchronism 
with each other. The VBV has no concept of data delivery 
start time, only of picture removal time based on the VBV 
delay. In order for the graphs to hne up, a constant offset was 
applied to the VBV sample times in the Figure. 

In FIG. 3 it can be seen that the multiplexer and the 
encoder have followed a similar strategy at the start of chp. 
That is to say, their buffers have been filled up as much as 
possible before the first picture is removed, as described 
above. Also it can be seen that, in this case, the STD buffer 
occupancy line always contains the VBV occupancy. The 
exception to this rule is during start-up because the data rate 
of the DSM is higher than that of the VBV so dehvery to the 
STD starts after the VBV. 

Turning now to the production of system streams that can 
be concatenated together, the produclioo of seamless sy.stem 
streams containing video will first be considered. Referring 
back to FIG. 1, it will be seen that the time taken for all the 
bits from the end of sequence A to be removed is dependent 
on how many pictures are in the buffer at the time when data 
delivery for sequence A stops. This time is dependent on the 
source material and the encoder used. Accordingly, the 
buffer occupancy (BJ at time t^ will vary from clip to clip. 
Therefore there is no way the multiplexer can calculate what 
t^ and B_^. would be for the general case based on the 
knowledge that is usually available to it, ie. bit rate, frame 
rate and what B^ was. 

To avoid the problem of all the unknowns described 
above, we use the fact that the data in the buffer when 
delivery of sequence A stops is sufBcient to cover the time 
it will take for constant rate delivery to deliver B, bits. We 
have recognised that this allows the setting of a maximum 
time for delivery of B, bits at the start of the multiplexed 
stream as: 

B. (1) 

where t, is as shown in FIG. 1, and R is the delivery rate. 

This time is important to the multiplexer because it allows 
the setting of the SCR time at this point, since this is the 
point when the first picture shoiild be removed from the STD 
buffer. Therefore, the multiplexer is set to fill the STD buffer 
to at least B, bits and set the SCRs so that at this time the first 
picture is removed. Note that this contrasts the conventional 
technique in which the buffer is filled as much as possible at 
which point the first picture is removed. 
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It should be pointed out here that the STD buffer is bigger 
than the VBV buffer. In the constrained system parameters 
case the STD buffer can be up to 46 kbytes compared to a 
maximum VBV size of 40 kbytes. This extra space allows 
5 the multiplexer to interleave different elementary streams 
into one system stream hiding the sporadic delivery to the 
decoder. 

FIG. 4 shows graphs of what happens in the VBV and 
STD buffer at the start of a targeted multiplexing operation. 

10 As with FIG. 3 the graph shows occupancy B (bits) against 
time t in the form of system ticks (clock pulses). It will be 
noted that the occupancy of the VBV and STD buffers is the 
same at the point when the first picture is removed from the 
buffer. From that point on the occupancy of the STD buffer 

15 starts to overtake that of the VBV buffer because the data 
delivery rate is higher. 

The multiplexer has far less control over the end state of 
the STD buffer. The end state is dependent on the size of the 
pictures still in the buffer which is controlled by the encoder. 

20 Preferably, the multiplexer attempts to dehver the data to the 
STD as soon as possible to guard against a problem known 
as SCR overlap: where data becomes due for a following 
clip when not all the data from the first clip has been 
delivered. A suggested multiplexer strategy for this is to 

25 make a data packet whenever the STD buffer can accept data 
no matter how small, 

FIG. 5 shows graphs of the VBV and STD states at the end 
of a cHp. It can be seen that delivery of data to the STD stops 
before the VBV because the STD buffer is larger and the 

30 received bit rate is higher. Towards the end of the clip the 
two graphs re-converge. This is the point in time when data 
delivery to both buffers has stopped and they contain only 
the pictures that make up the end of the clip. The time taken 
for these end pictures to be removed from the buffers is the 

35 time available for filling up the buffers with data from the 
next clip before the first picture of that clip is removed. This 
fill time is shown as time t, in FIG. 1. 

Turning now to audio signals, there is no equivalent of the 
VBV buffer for audio which makes the production of 

40 seamless audio much simpler than for video. The reason for 
this is because audio is coded at a constant instantaneous bit 
rate: in odier words, all similarly coded audio frames are the 
same size, to within one byte, whereas video frames vary in 
size depending on the frame type and complexity. It is, 

45 however, possible for the multiplexer to target an STD buffer 
slate at the time when the first audio frame is removed in a 
similar way to that of video. 

FIG. 6 shows a graph of the start of a targeted audio 
multiplexing operation. In this case the multiplexer targets 

50 the STD state at 2048 bytes (16384 bits) at the time when the 
first audio frame is removed. The fill time is analogous to t^ 
in FIG. 1 and has the similar property that the time must not 
be greater than the time represented by the audio frames in 
the buffer when data delivery stops. When the multiplexer 

55 has reached its target and the first audio frame has been 
removed it follows the strategy of keeping the STD buffer as 
full as possible. The large increasing slopes of the graph 
represent the delivery of an audio packet. In this case an 
audio packet is about 2 kbytes big because it was multi- 

60 plexcd for CD-i play- back with a CD-i sector about 2 kbytes 
big. 

FIG. 7 shows a graph of what happens in the STD at the 
end of an audio clip. The regular stepping down of occu- 
pancy representing the audio frames in the buffer begins at 
65 the time when data delivery stops. These residual audio 
frames will mask the change over from the end of one audio 
clip to the next. They last a finite time depending on the 
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number of frames but as can be seen by referring to FIG. 6 
the time to fill the STD buffer to its target is so short 
compared to the time masked by the residual frames that 
STD underflow should not occur, assuming that the DSM bit 
rate is substantially higher than the audio bit rate. Our 
experiments have shown that, for audio, targeting the STD 
buffer at about 50% fullness gives acceptable results. 

FIG. 8 shows an encoder apparatus having separate 
encoder stages 10, 12 for video V and audio A signals 
respectively. A further input for a frame reference signal FC 
identifying the frames within the video signal (such as by 
relative position with respect to the first and last pictures of 
a sequence) is connected to a target setting stage 14 which 
specifies the bit allocation for pictures of a sequence. It will 
be noted that there is no corresponding target stage for the 
audio encoder; this is not required due to the constant size 
of the encoded audio frames as described above. 

The output from each of the encoder stages 10, 12 is 
shown as being passed to respective buffers 16, 18 (although 
in practise these will form part of the encoder stages). A 
feedback path 20 from the output of the video encoder stage 
10 to the target setting unit 14 enables checking of whether 
the set targets have resulted in the required number of bits 
per picture in the encoded group. 

The buffered outputs are interleaved by a multiplexer 22, 
which also inserts the time stamps, to form a single data 
stream DS (the system stream) which may then be trans- 
mitted to a decoder or stored on a suitable medium, such as 
a compact disc. 

At the decoder (FIG. 9), a demultiplexer 24 separates the 
stream DS into video and audio channels by reference to a 
detector 26 monitoring the input stream. The separated video 
and audio streams are passed to respective buffers 28, 30 and 
from there to respective decoder stages 32, 34 to provide 
decoded output signals Voutj ^ut- Whilst the buffers 
would in practise form part of the respective decoder stages, 
they are shown separate to illustrate more clearly the STD 
model. 

From reading the present disclosure, other variations will 
be apparent to persons sldlled in the art. Such variations may 
involve other features which are already known in the 
methods and apparatiises for editing of audio and/or video 
signals and component parts thereof and which may be used 
instead of or in addition to features already described herein. 
Although claims have been formulated in this application to 
particular combinations of features, it should be imderstood 
that the scope of the disclosure of the present application 
also includes any novel feature or any novel combination of 
features disclosed herein either implicitly or explicitly or 
any generalisation thereof, whether or not it relates to the 
same invention as presently claimed in any claim and 
whether or not it mitigates any or all of the same technical 
problems as does the present invention. The applicants 
hereby give notice that new claims may be formulated to 
such features and/or combinations of such features during 
the prosecution of the present application or of any further 
application derived therefrom. 

We claim: 

1, A method for encoding digital video signals which are 
comprised of segments each comprising two or more pic- 
tures using an encoder apparatus which includes a coding 
stage, an encoder buffer, and means for interleaving the 
encoded video signal with at least one further signal to form 
a system stream, the method comprising, for each segment, 
the steps of: 

successively encoding the pictures of the segment accord- 
ing to a predetermined coding scheme, to thereby 
produce encoded pictures; 
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reading the encoded pictures into the encoder buffer; 

reading the encoded segment out of the encoder buffer at 
a substantially constant bit rate, to thereby produce a 
buffer output signal; 

interleaving the buffer output signal, to thereby produce 
the system stream; 

controlling an occupancy level of the encoder buffer in 
such a manner as to provide a predetermined encoder 
buffer occupancy level at a moment a last picture of a 
previous segment has been read into the encoder buffer, 
the predetennined encoder buffer occupancy level hav- 
ing a predetermined relationship to a prescribed occu- 
pancy level of a decoder buffer; 

determining a fiUing time required to fill the decoder 
buffer to the prescribed decoder buffer occupancy level 
at the substantially constant bit rate starting at a time 
that loading of the following segment into the decoder 
buffer is commenced; and 

inserting a time stamp into the system stream of a fol- 
lowing segment indicating a prescribed read-out time 
for a first picture of the following segment to be read 
out of the decoder buffer, wherein the prescribed read- 
out time is delayed from the time that the loading of the 
following segment into the decoder buffer is 
commenced, by a time delay equal to the determined 
filling time. 

2. The method as claimed in claim 1, further comprising 
the step of controUably varying a target number of bits iised 
to encode the pictures of the previous segment in a manner 
such as to achieve the predetermined encoder buffer occu- 
pancy level. 

3. The method as claimed in claim 1, wherein a maximum 
encoder buffer occupancy level is specified, and the pre- 
scribed decoder buffer occupancy level is defined as the 
difference between the maximum encoder buffer occupancy 
level and the predetermined encoder buffer occupancy level 
one frame period after the last picture of the previous 
segment has been read into the encoder buffer. 

4. The method as claimed in claim 2, wherein a respective 
target number of bits is specified for each of the last K 
picmres of the segment, where K is an integer. 

5. The method as claimed in claim 1, wherein the pictures 
of the segment are encoded in the successively encoding 
step according to an MPEG standard. 

6. A digital video signal encoder apparatus for encoding 
image segments each comprised of two or more pictures, the 
apparatus comprising: 

an encoding stage arranged to receive successive pictures 
of each segment and encode the successive pictures 
according to a predetermined coding scheme, to 
thereby produce successive encoded pictures; 

an encoder buffer coupled to receive the successive 
encoded pictures from the encoding stage and arranged 
to output encoded segments at a substantially constant 
bit rate, to thereby produce a buffer output; 

a multiplexer coupled to receive the buffer output and at 
least one further signal and arranged to interleave them 
into a system stream; 

wherein the encoding stage is operable to controUably 
vary the number of bits used to encode the pictures of 
each segment; 

target setting means arranged to control, for each 
segment, the number of bits used by the encoder to 
encode the picmres of that segment in a manner such as 
to produce a predetermined encoder buffer occupancy 
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level at a moment that a last picture of a previous encoded in respective numbers of bits such that, when the 

segment is read into the encoder buffer; encoded segment is read at a substantially constant bit rate 

time stamp means operable to derive, for each segment, a into a decoder buffer from which successive pictures are 

filling time required to fill a decoder buffer to a pre- removed for decoding at a real time display rate, a prede- 
scribed decoder buffer occupancy level at the substan- ^ termined decoder buffer occupancy level occurs at the 

tially constant bit rate starting at a time that loading of moment the data for the last picture of the segment has been 

a following segment is commenced md to insert a time read into the decoder buffer, and the system stream for each 

stamp in the system stream of the following segment segment includes a time stamp indicating a prescribed 

indicated a prescribed read-out time for a first picture of read-out time for a first picture of the following segment to 

the following segment to be read out of the decoder be read out of the decoder buffer, wherein the prescribed 

buffer, wherein the prescribed read-out time is delayed read-out time is delayed from the point following loading of 

from the time that the loading of the following segment that following segment into the decoder buffer by a time 

into the decoder buffer is commenced, by a time delay delay equal to a time required to fill the decoder buffer to the 

equal to the determined filling time. predetermined decoder buffer occupancy level starting at a 

7. The apparams as claimed in claim 6, wherein the target 15 time that loading of that following segment into the decoder 
setting means is operable to control, for each segment, the buffer is commenced, the time delay specified by the time 
number of bits used by the encoding stage to encode each stamp being related to the predetermined decoder buffer 
picture, for the last K pictures of that segment, where K is occupancy level. 

an integer. 9. An optical disc carrying a digital video image signal 

8. A digital video image signal which includes a plurality comprised of a plurality of encoded segments as claimed in 
of segments encoded by a method according to claim 1, each claim 8, wherein all segments provide a common predeter- 
scgment comprising a sequence of pictures encoded accord- mined decoder buffer occupancy level. 

ing to a predetermined coding scheme, wherein each of the 

last K pictures of a segment (where K is an integer) are * * * >^ * 
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